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Abstract 

Food webs, networks of feeding relationships among organisms, provide fundamental insights into mecha- 
nisms that determine ecosystem stability and persistence. Despite long-standing interest in the compart- 
mental structure of food webs, past network analyses of food webs have been constrained by a standard 
definition of compartments, or modules, that requires many links within compartments and few links be- 
tween them. Empirical analyses have been further limited by low-resolution data for primary producers. 
In this paper, we present a Bayesian computational method for identifying group structure in food webs 
using a flexible definition of a group that can describe both functional roles and standard compartments. 
The Serengeti ecosystem provides an opportunity to examine structure in a newly compiled food web 
that includes species-level resolution among plants, allowing us to address whether groups in the food 
web correspond to tightly-connected compartments or functional groups, and whether network struc- 
ture reflects spatial or trophic organization, or a combination of the two. We have compiled the major 
mammalian and plant components of the Serengeti food web from published literature, and we infer its 
group structure using our method. We find that network structure corresponds to spatially distinct plant 
groups coupled at higher trophic levels by groups of herbivores, which are in turn coupled by carnivore 
groups. Thus the group structure of the Serengeti web represents a mixture of trophic guild structure 
and spatial patterns, in contrast to the standard compartments typically identified in ecological networks. 
From data consisting only of nodes and links, the group structure that emerges supports recent ideas on 
spatial coupling and energy channels in ecosystems that have been proposed as important for persistence. 
Our Bayesian approach provides a powerful, flexible framework for the study of network structure; we 
believe it will prove instrumental in a variety of biological contexts. 



Introduction 

Food webs, networks of feeding relationships in 
ecosystems, connect the biotic interactions among 
organisms with energy flows, thus linking together 
population dynamics, ecosystem function, and net- 
work topology. Ecologists have been using this pow- 
erful conceptual tool for more than a century [l}j3] . 
One element of structure of particular relevance to 
large food webs is the subdivision of species into 
compartments or groups, a feature that has been 
proposed to contribute to food web stability by con- 
straining the propagation of disturbances through a 



network [4] . Although a large literature has consid- 
ered the presence and dynamic significance of com- 
partments in food webs, on the whole, evidence for 
the importance of compartments has been inconclu- 
sive [5j|8]. In this literature, compartments are al- 
ternately referred to as modules, clusters, or "com- 
munities" (9], and are defined by high link density 
within groups and low link density between them. 

Recent work with a probabilistic model consid- 
ers a more flexible notion of groups, allowing link 
density to be high or low within any group or be- 
tween any pair of groups [8]. Groups can thus rep- 
resent compartments in the previous sense, but can 
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also represent trophic guilds or roles 10 11 , sets 



of species that feed on, and are fed on, by simi- 
lar sets of species. By fitting models of this type 
to data, the dominant topological pattern in the 
network can be found, which may include (spatial) 
compartments, or trophic guilds, or some combi- 
nation of the two. The initial application of this 
model to empirical food webs from different ecosys- 
tems has revealed a predominance of trophic guilds 
rather than compartments [8]. 

Two major challenges limit the application of 
this model in resolving the group structure of large 
food webs and interpreting its biological basis. 
First, most food webs have poor resolution of pri- 
mary producers; plants in terrestrial systems and 
phytoplankton in aquatic ones are typically repre- 
sented by a few nodes that are highly aggregated 
taxonomically. This is unfortunate, because their 
position at the base of a food web means that pri- 
mary producers are essential for understanding how 
the web is spatially organized, and how this spatial 
organization percolates up through the trophic re- 
lationships of other species. Fully resolved primary 
producers are also essential in examining how the 
structure of other trophic levels cuts across their 
spatial distribution, and, in so doing, couples dif- 
ferent habitats. 

Second, some technical problems have hindered 
the use of probabilistic models in analyzing group 
structure. Early food web models served as null 
models for food web structure and were tested by 
generating model webs and comparing summary 
statistics against data from real webs 12 13 . More 



recently, a more rigorous approach for measuring 
the goodness of fit of a model has been provided 
by maximum likelihood and model selection [8 14 



two problems still remain within this framework. 
One is technical: standard model-selection criteria 
are not applicable to "discrete parameters" such as 
group membership. The second problem is more 
fundamental: there are many almost equally good 
arrangements, and it is desirable to extract infor- 
mation not just from a single best arrangement, but 
also from the rest of the ensemble. 

The Bayesian approach is gaining popularity 
in ecological modeling due to the philosophical 
and conceptual appeal of explicitly considering un- 
certainty in parameter estimation as well as its 
methodological flexibility [15]. This approach is es- 



pecially well-suited for handling uncertainty in com- 
plex food web models, and allows us to overcome 
the limitations of the previous implementation of 
the group model. In network inference, there are 
only a few examples of complete Bayesian models 



16017 and a few examples of MCMC for maximum- 



likelihood inference 18 19 , but Bayesian inference 
in phylogenetics has been long established (20 21 , 
and provides a clear methodological analogue. 

In this paper, we address the group structure 
of a newly assembled food web for the mammals 
and plants of the Serengeti grassland ecosystem of 
Tanzania; we use a new and novel approach to the 
identification of groups based on Bayesian infer- 
ence. We specifically ask whether the structure that 
emerges reflects the underlying spatial dimension, 
as delineated by the different plant communities 
that characterize different sub-habitats within the 
ecosystem, or whether it is determined by trophic 
dimensions in the form of species guilds that share 
functional roles. 

The Serengeti food web is emerging as the most 
highly resolved terrestrial web to date 22 . The 



Serengeti has been studied as an integrated ecosys- 
tem for almost five decades [23| - [25] , and because of 
widespread popular familiarity with the consumer- 
resource dynamics of lions, hyenas, wildebeest, ze- 
bra and grasses, it provides a strong intuitive test 
for probabilistic food web models. Most impor- 
tantly, at the primary producer level, the Serengeti 
food web includes a number of distinct grass and 
woodland plant communities on different soils and 
across a rainfall gradient 26 . The sequential and 



well-documented changes in the underlying plant 
diversity allows us to examine the extent to which 
grassland communities define network topology at 
higher trophic levels. The high resolution of the 
species that comprise the mammal and plant com- 
munities of the ecosystem allow us to address the 
role of space in the group structure of the food web. 

From a complex network of only nodes and links 
that represent species and their interactions, the 
groups that emerge from an otherwise blind classi- 
fication of species make remarkable biological sense. 
We find that the group structure identified in the 
Serengeti food web represents a mixture of trophic 
guild structure and spatial patterns, an arrange- 
ment that differs significantly from the compart- 
mcntal structure typically identified in ecological 
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networks. 

Results /Discussion 

Bayesian Inference and Model Selec- 
tion for Food Webs 

Probabilistic Models for Food Webs 

In this paper, we use probabilistic modeling as 
a tool for formalizing hypotheses about food web 
structure. We treat a food web, an observed net- 
work of who eats whom in an ecosystem, as data. 
We start with the basic question: assuming a prob- 
abilistic model of food web structure, what is the 
probability of observing this particular real-world 
food web? This probability is referred to as the 
likelihood of observing the data, given model pa- 
rameters. In a maximum-likelihood framework, the 
mechanical part of the inference process is to find 
the set of model parameters that makes the likeli- 
hood as great as possible, with the interpretation 
that this represents the best point estimate of the 
underlying process. 

We begin with the group model of Allesina 
and Pascual [8], which was originally treated in a 
maximum-likelihood framework. Conceptually, this 
model encodes the simple hypothesis that species 
can be divided into groups, and species in the same 
group have statistically similar behavior: they tend 
to consume species in certain groups and tend to 
be consumed by species in certain groups. Specif- 
ically, the probability that a species belonging to 
group i is eaten by a species belonging to group j 
is given by p^ , and conversely, the probability of a 
link being absent is (1 —pij). If there are K groups, 
then a matrix P of K 2 link probabilities is required 
to completely describe the relationships among all 
groups. The likelihood for the whole network is 
the product over all pairs of species of the proba- 
bility of a link being present (if present) or absent 
(if absent). In the statistical literature, this model 
structure is known as a stochastic block model 27 



The assignment of species to groups is also an unob- 
served parameter in this model, which adds a layer 
of difficulty to parameter estimation. For example, 
in a network of 100 species, there are approximately 
5 x 10 115 different ways to partition the network into 
groups (see Methods). That is, if you had a com- 
puter that could process 10 80 partitions (as many 



partitions as there are atoms in the universe) every 
femtosecond (10 -15 s), it would take 1.5x 10 13 years 
to process them all. (By comparison, the universe 
is only 1.4 x 10 10 years old.) 

The group model allows for a more flexible def- 
inition of groups than standard approaches to net- 
work clustering, which find groups that have large 
numbers of internal connections and relatively few 
connections between groups p) . Because each p t j 
parameter may take any value between and 1, 
good model fits may result from other relationships, 
such as high link density between groups and low 
link density within groups, and may accommodate 
different relationships in different parts of the net- 
work. In general, the best-fitting partitions will try 
to maximize or minimize the number of links within 
specific groups and between specific pairs of groups. 



Bayesian Inference and Priors for the Group 
Model 

In a Bayesian framework, rules of probability are 
taken to govern both the data and model parame- 
ters. Rather than finding the set of parameter val- 
ues that maximize the likelihood, the goal becomes 
to estimate a probability distribution over param- 
eters based on observed data. In this way, we can 
directly quantify the uncertainty in our parameters 
in terms of probabilities. This permits questions 
such as: what is the probability that a parameter 
lies in a particular range? The name "Bayesian" 
comes from Bayes' rule, which tells us how to use 
conditional probability statements to infer a poste- 
rior distribution, in this case, the probability distri- 
bution over parameter values conditional on having 
observed the data, Pi(8\D). If we are dealing en- 
tirely with discrete probability distributions, Bayes' 
rule takes its most intuitive form: 



Pt(6\D) = 



Pr(0)Pr(D|0) 
Pr(L>) 



(1) 



The numerator of the right-hand-side is the prob- 
ability of producing the data from the given pa- 
rameters: the prior probability of those parame- 
ters, Pr(0), times the probability of producing the 
data given those parameters, Pr(D\9), the likeli- 
hood. The denominator is the marginal probability 
of observing the data unconditional on the partic- 
ular parameter values at play, which is simply the 
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sum of the probabilities of all the different ways of 
producing the data using all possible parameter val- 
ues, Pr(D) = X) e Pr(0)Pr(.D|0). In other words, 
in order to calculate the posterior probability of 
parameters 9, we add up all the different ways of 
producing the data weighted by their probability, 
and then calculate what fraction of that probability 
came from parameters 9. From here, we will write 
these quantities in more general notation, suitable 
for a mix of discrete and continuous probability dis- 
tributions: 



f(0\D) 



f(0)f(D\0) 
J g f(9)f(D\9)d9 



(2) 



where the integral sign represents a multiple inte- 
gral over discrete and continuous parameters. 

In the Bayesian framework, the model includes 
not only the formulation of the likelihood but also 
a prior distribution over parameters. With the 
group model, this means defining a prior distribu- 
tion over both link probabilities and arrangements 
into groups ("partitions"). In general, priors may 
incorporate informed knowledge about the system, 
but in this case we simply use them to encode dif- 
ferent variants of the same basic model. We use two 
distributions for partitions and two distributions for 
link probabilities, which are combined to form four 
different model variants. 

The two alternative distributions for elements 
Pij of the link probability matrix P arc (1) a uni- 
form distribution between and 1, and (2) a beta 
distribution with shape parameters a and /3, which 
are in turn governed by exponential distributions 
with mean 1. With a and j3 fixed at their means, 
alternative (2) reduces to a uniform distribution; at 
other values, the distribution may take a uniform, 
convex, concave, or skewed shape. Alternative (2) 
is thus structured hierarchically, with exponential 
hyperpriors for a and /? governing the beta prior 
for elements of P. 

For partitions, we consider (I) a uniform dis- 
tribution and (2) a distribution generated by the 
Dirichlet process, sometimes referred to as the "Chi- 
nese restaurant process" 28 . Alternative (2) is con- 



trolled by an aggregation parameter x that is in 
turn drawn from an exponential distribution with 
mean 1. The uniform distribution assigns equal 
prior probability to each possible partition, irre- 
spective of the number of groups. Because there 



are far more ways to partition the network at an in- 
termediate, but relatively high, number of groups, 
the uniform prior implicitly biases the model to- 
ward that number. For example, in the Serengeti 
food web, there are 170 nodes, yielding an a pri- 
ori expectation of 43.8 groups. In contrast, the hi- 
erarchically structured Dirichlet process prior pro- 
vides flexibility via the aggregation parameter x- 
When x is large, partitions tend to have many small 
groups; when x is small, partitions tend to have 
fewer groups, with a skewed group size distribution. 

We also consider two simple models without 
groups as null comparisons: (1) a directed random 
graph model (i.e., one group) with a uniform prior 
on a single link probability parameter p, and (2) a 
fully parameterized model, with each species in its 
own group (and a 170 x 170 link probability param- 
eter matrix P, also with a uniform link probability 
prior. 

For a fuller discussion of models and prior distri- 
butions, in particular the properties of the distribu- 
tion generated by the Dirichlet process, see Meth- 
ods. 

Bayesian Model Selection via Marginal Like- 
lihood 

The Bayesian framework provides a natural way to 
make probabilistic inferences based on a particular 
model. However, we also want to be able to choose 
between different models by quantifying their rel- 
ative goodness of fit. One approach to Bayesian 
model selection can be framed directly in terms of 
Bayes' rule, mirroring the process for estimating the 
posterior distribution over parameters for a single 
model. 

Consider two models, Mi and M2, to which we 
assign prior weight Pr(Mi) and Pr(-Ma). After the 
data has been observed, we can calculate the pos- 
terior probability of the models using Bayes' rule: 



Pr(Mi|D) 



Pr(M 2 |C) 



Pr(Mi)Pr(L>|Mi) 
Pr(£>) ' 



Pr(M 2 )Pr(£>|M 2 ) 



Pr(D) 



(3) 



(4) 



where the denominator is equal to the prob- 
ability of observing the data unconditional 
of the particular model at play, Pr(D) = 
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Pt(M 1 )P(D\M 1 ) + Pv{M 2 )P{D\M 2 ). The prob- 
abilities PrplMO = J 9i f(6 1 )f{D\e 1 )d9 1 and 
Pr(L>|Af 2 ) = fg 2 f{0 2 )f{D\0 2 )d6 2 are the marginal 
likelihoods of the two models, corresponding to the 
denominator in Equation[2j If we give the two mod- 
els equal prior weight, then the relative posterior 
weight of the two models is simply given by the 
marginal likelihoods. This reasoning extends natu- 
rally to any number of models. 

The ratio of the marginal likelihoods is often 



called the Bayes factor 29-31 , and is equal to the 



posterior odds ratio of the two models, assuming 
equal prior weight: 



B12 



Pr(L>|Afi 
Pi(D\M 2 



(5) 



The Bayes factor provides a convenient way to com- 
pare models: if B\ 2 — 10, then we consider support 
for model Mi to be ten times stronger than model 
M 2 . In AlC-based model selection, the Bayes factor 



is analogous to a ratio of Akaike weights 32 . 



Consensus Partitions 

The output of an MCMC simulation includes a 
long sequence of network partitions representing 
draws from the posterior distribution over parti- 
tions. As these partitions are potentially all distinct 
from each other, but represent similar tendencies of 
species to be grouped together, it is useful to try 
to summarize the information contained in all the 
samples in a more compact form. One approach is 
to construct a pairwise group-membership matrix 
for species in the food web, with entries equal to 
the posterior probability that two species are in the 
same group. A visual representation of this matrix 
can illuminate the group structure, and a consensus 
partition can then be constructed from this matrix 
using a simple clustering algorithm. (For more de- 
tails, see Methods.) 

Bayesian analysis of the Serengeti 
food web 

The Serengeti Data Set 

We compiled the food web from published accounts 



of feeding links in the literature [26 33 - 45 
The 



feeding links among S — 170 species (136 plants, 25 
herbivores, and 9 carnivores). 550 of the links are 
herbivorous, and 117 are predatory. The fraction 
of all possible links or connectance (C = L/S 2 ), ig- 
noring all biological constraints, is equal to 0.0231. 



Performance of Model Variants 

We find unequivocal support for the use of group- 
based models in describing the Serengeti food web. 
Models with group structure have vastly greater 
marginal likelihoods than simple null models that 
ignore group structure (Table [lj. Furthermore, the 
use of both the Dirichlet process prior for network 
partitions and the beta prior for link probability pa- 
rameters vastly improved the fit of the basic model. 
The best model variant as measured by marginal 
likelihood included both the beta prior on link prob- 
abilities and the Dirichlet process prior on parti- 
tions. The next best variant included a uniform 
partition prior and beta link probability prior, fol- 
lowed by the variant with Dirichlet process prior 
and uniform link probability prior. The strongest 
variant surpassed its closest competitor by 106 log- 
orders in likelihood and surpassed the model with 
two uniform priors by 470 log-orders in likelihood, 
providing unequivocal support for including both 
flexible priors in the model specification. Accord- 
ingly, in the remaining analysis we consider only the 
best model variant. 



Identification of Model Parameters 

The posterior mean number of groups K is 14.9 
(95% credible interval 12, 18), and the mean value 
of the Dirichlet process parameter \ is 3.2 (1.6, 5.3) 
(Figure [2]). The prior expectation of x was 1-0 an d 
the prior expectation of K was 5.4. The finding of 
posterior values substantially greater than prior val- 
ues strongly supports the presence of detailed group 
structure in the Serengeti food web. 

Mean values for beta distribution parameters 
are a = 0.046 (0.029,0.068) and (i = 0.89 
(0.50,1.43) (Figure |S2|). The corresponding beta 
prior has support concentrated near 0, since most 



compiled Serengeti food web (Ta- 
bles SI and S2 and Figure [lj consists of L = 667 



species do not feed on most other species (Figure 
Sll. 
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Groups Identified in the Serengeti Food Web 

The structure of the Serengeti food web is best 
represented by groups containing trophically simi- 
lar species, subdivided by specialization on different 
components of prey species, with plant species cor- 
responding to spatially distinct habitats. The best 
consensus partition, with 16 groups, is shown in Ta- 
ble^ There are 6 groups of plant species (groups 
11-16), 7 groups of herbivore species (groups 4- 
10) and 3 groups of carnivores species (groups 1- 
3). On average, plant groups contain more species 
than herbivore and carnivore groups (22.7, 3.6 and 
3.0, respectively). As evident in the pairwise group- 
membership matrix (Figure [3]), the carnivore and 
herbivore groups are well-defined, including several 
individual species or pairs of species with distinct 
diets. Plant groups demonstrate mild overlap, indi- 
cating a partially hierarchical relationship between 
smaller groups and larger groups. 

Figures |4j |6j and [5] show three alternate views 
of the food web, organized by the 16-group consen- 
sus partition. Groups 11-13, 14-15, and 16 con- 
sist of plants located in grassland, woodland, and 
kopje habitat, respectively. Group 11 includes short 
grasses that are grazed by a wide variety of species, 
distinguished from group 12, which includes short 
grasses fed on only by the large grazers (4). The 1- 
species groups (2, golden jackal Canis aureus; 6, 
African buffalo Syncerus caffer; 8, agama lizard 
Agama planiceps; 10, elephant Loxodenta africana; 
and 13, Microchloa kunthii) represent individual 
species with distinct diets or consumers. 

Plant groups are coupled by groups of herbi- 
vores, which are in turn coupled by groups of car- 
nivores. Large migratory grazers (4, wildebeest, ze- 
bra, and gazelles) feed primarily on plant groups 
consisting of grasses (11 and 12), and are the ex- 
clusive consumers of a group of grasses adapted to 
seasonal rainfall on sandy or volcanic soils (12) that 
dominate the short-grass plains in the southern part 
of the ecosystem. Herbivores feeding in the longer 
grasslands and woodlands and in riparian habitats 
(group 5) couple the first grass group (11) with 
woodland plants (group 14), which are also con- 
sumed to a lesser extent by the large grazers. The 
hyraxes (group 7) and group 9 (giraffe, olive ba- 
boon, and dik-dik) couple kopje habitat (group 16) 
with both woodland and grassland plants. At the 
highest trophic level, the large carnivores (1) inte- 



grate across all the herbivore groups; smaller carni- 
vores (2, 3) show more specialized diets, reflecting 
the more distinct habitats in which they are usually 
found. 



Discussion 

Spatial Guilds in the Serengeti Food Web 

In order to analyze the group structure of the 
Serengeti food web, we used a flexible Bayesian 
model of network structure that includes no biolog- 
ical information aside from a set of nodes represent- 
ing species and links representing their interactions. 
The groups that emerge from an otherwise blind 
classification of species make remarkable biological 
sense. Species are divided into trophic guilds that 
reveal a clear relationship between the spatial orga- 
nization of plant, herbivore, and carnivore groups 
and the structure of the network. At the coarsest 
scale, the groups in the Serengeti food web corre- 
spond to carnivores, herbivores, and plants. The 
further subdivisions that emerge within carnivores, 
herbivores, and plants reveal a spatial dimension 
to feeding structure that is only evident because 
of high species resolution at the plant level. Ulti- 
mately, the group structure we have derived mir- 
rors the flow of energy up the food web from dif- 
ferent spatial locations, with herbivores integrat- 
ing spatially separated groups of plants, and car- 
nivores integrating spatially widespread herbivores. 
Although the addition of birds, reptiles, inverte- 
brates, and pathogens will certainly add a signifi- 
cant number of new groups, we do not expect them 
to significantly modify the derived structure for the 
mammal and plant community. Nor will they mod- 
ify the larger tendency for groups to be assembled in 
ways that reflect the underlying spatial and trophic 
structure of the species in the web. 

Recently, interesting theoretical and empirical 
work has highlighted the relationship between ob- 
served patterns of food-web structure and energy 
flow that seemingly mirrors the trophic guild struc- 
ture in the Serengeti 
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Rooney and colleagues 
give evidence that real ecosystems may be domi- 
nated by nested sets of fast and slow "energy chan- 
nels," each of which represents a food chain of 
trophic guilds. They suggest that this pattern may 
have a strong stabilizing effect, based on theoreti- 
cal work by McCann on spatially coupled food webs 
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47 . The group structure for the Serengeti web that 
emerges from our analysis supports a pattern of spa- 
tial coupling at multiple trophic levels: the grass- 
lands have very high turnover rates compared to 
those of the kopjes and woodlands. This suggests a 
similar pattern of fast and slow energy channels to 
those described by Rooney and colleagues, with fast 
energy flow up through the highly seasonal but very 
productive grasses of the short-grass plains. These 
arc almost completely consumed by wildebeest and 
zebra during their peak calving season, which are 
then in turn consumed by large predators (lions and 
hyenas). In contrast, the resident herbivore species 
living on kopjes or in the woodlands reproduce at 
slower rates and are consumed less frequently by 
large carnivores, except during the time when they 
are unable to feed on migratory wildebeest and ze- 
bra. The group model and the inference approach 
presented here allow the examination of the dynam- 
ical consequences of this type of structure to be 
fully rooted in an empirical pattern, complementing 
more theoretical considerations of its central impor- 
tance for preserving biodiversity [47] . 

These patterns emerge directly from the topol- 
ogy of the food web without being explicitly labeled 
as different habitats upfront as was done in previous 
empirical work 
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showing that topological anal- 
ysis can reveal structures that may be very signifi- 
cant for food-web dynamics. They are subtly differ- 
ent, however, from the proposed pure fast and slow 
chains, in that they incorporate the migration of 
the keystone species in the ecosystem, so the fastest 
energy chain is seasonally ephemeral and may only 
operate for three to four months in any year. We 
suspect that even within the sub-habitats of kopjes 
and woodlands there are similarly nested faster and 
slower chains that involve species for which we are 
still collating data (e.g., birds, small mammals, and 
insects). 

Bayesian Analysis of Food- Web Structure 

In this paper, we used a probabilistic model to ana- 
lyze the structure of a single food web, an approach 
we have seen in only one other study based on a 
probabilistic version of the niche model 



19 . This 



approach has proved fruitful in Bayesian phyloge- 
netics, where the combinatorial challenges are sim- 
ilar. Moreover, we view the group model as only 
a starting point for richer modeling efforts to help 



identify relevant processes that influence the struc- 
ture of ecological communities. 

In fact, the Bayesian approach described here 
provides a powerful general framework for encod- 
ing hypotheses about the structure of food webs and 
comparing models against each other, and we see it 
as a natural next step in the current trend of repre- 
senting food-web models in a common way. Simple 
abstract models such as the niche model and the 
group model used here act as proxies for the high- 
dimensional trait space that determines feeding re- 
lationships in an ecosystem. The identification of 
actual traits that correspond to groups (or niche 
dimensions) is another valuable direction, so far 
followed primarily by finding correlations between 



compartments/groups 48 or niche values 13 and 



traits such as body size or phylogenetic relatedness. 
Another approach is to directly incorporate these 
traits into the probabilistic models, either as model 
predictors or as informed priors. Both kinds of 
analyses are valuable, but the second kind becomes 
more approachable in a general Bayesian modeling 
framework. 

The use of flexible hierarchical priors for model 
parameters is one straightforward innovation pos- 
sible in the Bayesian framework. The number of 
groups identified by the model increases dramat- 
ically with the use of a flexible beta prior distri- 
bution for link probability parameters. In that 
model variant, we effectively introduce two degrees 
of freedom to the model (the beta distribution pa- 
rameters) but dramatically reduce the effective de- 
grees of freedom of the link probability parameters. 
Note that we properly penalize parameters by using 
the marginal likelihood for model selection, so that 
the model selection rep- resents a balance between 
goodness of fit and model complexity. Moreover, 
this structure makes intuitive sense: since most link 
probability parameters are simply zero, they should 
not be penalized. An alternate approach is to re- 
move and add parameters to the model, but this 
hierarchical technique is much easier to implement 
in practice. 

Advanced Markov-chain Monte Carlo methods 
make it possible to accurately estimate marginal 
likelihoods for probabilistic network models. Un- 
like information criteria such as AIC, BIC, or DIC, 
an accurate estimate of the marginal likelihood pro- 
vides a direct measurement of goodness of fit that 
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takes into account the degrees of freedom in a model 
without making any asymptotic assumptions about 
parameter distributions [49], and can handle dis- 
crete parameters such as partitioning into groups 
that are not properly handled by AIC and BIC. 

The Bayesian approach also serves as a means 
to avoid fundamental issues inherent in network 
models with a large parameter space. In a re- 
cent study, Good and colleagues [50] examined the 
properties of module-finding in networks using the 
pervasive modularity-maximization approach |51| , 
finding that even in relatively small networks a large 
number of good solutions exist. A maximization 
algorithm is thus guaranteed to find a single local 
maximum of many — possibly even the best one, but 
certainly not one that captures the full range of 
good solutions. This problem arises whether the 
quantity to be maximized is a heuristic such as 
modularity or a likelihood value. The group model 
and other parameter-rich models presumably suffer 
from similar degeneracy problems. In the present 
case, we find that every partition sampled from 
the posterior distribution for the best-fitting group 
model variant is unique. Although MCMC sam- 
pling cannot reproduce the full posterior distribu- 
tion, it is an important step in the right direction. 
Philosophical arguments aside, one of the main rea- 
sons for maximizing likelihood or modularity is sim- 
ply that a single solution is far more tractable than 
a distribution. The consensus partitioning heuris- 



tic used here is one attempt at recovering a simpler 
object of study (see Methods); more sophisticated 
approaches will be welcomed. 



The group model, based on the simple notion 
that groups of species may have similar feeding re- 
lationships to other groups, reveals that trophic 
guilds are the topologically dominant type of group 
in the Serengeti food web. The model also reveals 
an interesting relationship between spatial struc- 
ture and network structure that corroborates recent 
ideas on spatial coupling in food webs. A theoret- 
ical study with a dynamical model suggests that 
this type of structure may contribute to 'stabil- 
ity' in the sense of the persistence of species 
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We are now in a position to examine different as- 
pects of stability, including robustness to secondary 
extinctions, based on structures directly inferred 
from empirical networks. Although the Bayesian 
modeling approach is not new to network analysis 
in general 
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it remains relatively rare. The 
Bayesian group model, and, more importantly, the 
general framework for modeling and model selec- 
tion, naturally extend to other kinds of biological 
networks, such as metabolic and regulatory net- 
works [52| and networks describing other ecological 
interactions such as pollination 53 . We advocate 



this framework as a way to build stronger ties be- 
tween hypothesis formulation, model building, and 
data analysis. 



Methods 
Group Model 

We use as a starting point the group model of Allesina and Pascual [8] , in which a network of N nodes is 
partitioned into K groups. The groups to which a potential prey and to which a potential predator belong 
completely determine the probability that a feeding relationship exists between them. The assignment 
of species to groups is given by the vector G = (gi, ■ ■ ■ ,g n ), with gi £ {1, . . . , K}. We refer to this 
assignment as a set 'partition,' in keeping with standard mathematical terminology. The probability that 
a species assigned to group i is consumed by a species assigned to group j is equal to py. This gives a 
matrix P of K 2 probabilities, containing the probabilities of observing directed links between members 
of each pair of groups, and within members of each group. 

If we take A to be the directed adjacency matrix of a network, with entries ay equal to 1 if a link 
exists from node i to node j, otherwise, then the probability of the network being generated by partition 
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G and link probabilities P is given by 

K K 

/(aig,p) nn^' 1 > ( 6 ) 

1=1 j=l 

where and Zij are the number of 1-entries and 0-entries in the submatrix of A containing entries from 
rows r satisfying g r = i and columns c satisfying g c = j. 

In the simplest case, all nodes are assigned to the same group, and the likelihood simplifies to 

f(A\p)=p Y (l-p) z (7) 

where Y and Z are the total number of 1-entries and 0-entries in A. 

In order to use the group model for Bayesian inference, we want to infer the posterior distribution 
over partitions and parameters, 

/(G,P|A)cx/(G,P)/(A|G,P). (8) 

This requires specifying a prior distribution over partitions G and link probabilities P. We consider two 
priors over G and two priors over P. 

Model Priors 
Priors for Partitions 

The simplest prior over partitions assigns equal probability to each possible assignment of nodes into 
groups. For a network of N nodes, the number of possible partitions is given by the Nth Bell number, 

N 

B(N)=Y / S 2 (N,K), (9) 

K=l 

where S2(N,K) is the Stirling number of the second kind, the number of ways to partition N objects 
into exactly K groups, 

^,K) = ^Y,{-l)^( K \f. (10) 
' j=0 v J / 

Therefore, the prior probability of a particular partition is uniform across all possible partitions 

/(G) = -Lj. (11) 

and the prior probability of having exactly K groups is 

S 2 {N,K) 

f[K) = -B{NT- (1) 

For partitions, the choice of a uniform prior, although simple, includes hidden assumptions. In particular, 
there are far more possible partitions for an intermediate number of groups than a small or large number, 
so the prior will implicitly bias results toward that number. For example, with 100 nodes, the distribution 
is peaked at K — 28 (Figure SI). 

An alternate prior for partitioning objects into groups comes from the Dirichlet process, also known 
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as the "Chinese restaurant process," which is becoming a standard Bayesian prior for related problems 



28 54 55 . Consider a restaurant with an infinitely large number of infinitely large tables, all initially 
empty. The first patron sits alone at the first table, and subsequent patrons may either sit at an occupied 
table or a new table. They choose occupied tables with weight equal to the number of current occupants, 
or a new table with weight equal to an aggregation parameter \- For example, the second patron will 
sit at the same table as the first patron with probability 1/(1 + \). In fact, because the process is 
exchangeable, the probability of any pair of patrons sitting at the table is also 1/(1 + x)- If X is small, 
there will tend to be a small number of occupied tables and a skewed distribution of tabic sizes; if \ is 
large, there will be a larger number of tables occupied by few patrons. 

Interpreting tables of patrons as groups of nodes, under the Dirichlet process the prior probability of 
a particular partition G is 

m\ X ) = x K ^!i V3 ~ 1) \ , d3) 

where N is the number of nodes in the network, K is the number of groups in the partition, and ijj is 
the number of nodes in group j. The prior probability of K groups is 

nm-m^.. (") 

Ili=i(x + i- 1) 

where Si(N,K) is a Stirling number of the first kind, equal to the coefficients on xk in the expansion 
x{x- l)(x-2)...(x- K + 1). 

Rather than choosing a fixed value of \ for the prior, we give x an exponential hyperprior distribution 
with mean 1: 

fix) = X> 0. (15) 

Priors for Link Probabilities 

Similarly, the elements of link probability matrix P may be given a simple uniform prior over [0, 1]: 

f(Pij) = 1 < jHj < 1. (16) 
As there may be some regularity in the values of the link probabilities, we also tried a beta prior: 



where B(a,f3) is the beta function, 



B(a,P)= [ t a - l {l-tf- l dt. (18) 
Jo 

The parameters a and /3 control the shape of the distribution, which may be convex, concave, or skewed 
toward or 1. When a = ft = 1, the beta prior becomes a uniform distribution. 
We use a and /? exponential hyperpriors with mean 1: 

f(a) = e- a a> 0, (19) 
f{fi) = e -P P> 0. (20) 
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Markov- chain Monte Carlo Sampling 

For networks of any appreciable size, the number of possible partitions is far too large to enumerate, so 
we must use a Markov-chain Monte Carlo (MCMC) technique to sample from the posterior distribution. 
Here we describe the sampling procedure and the details of the Metropolis-Hastings proposal distribution 
used. 



Sampling Procedure 

We employ the standard Metropolis-Hastings algorithm to sample from the posterior distribution over 



partitions and model hyperparameters 56 57 . The general idea of an MCMC method is to set up a 
sequence of dependent samples 0i,02c-< that is guaranteed to converge to a target distribution, in this 
case the posterior distribution of our model. Starting from the current sample, a change is proposed, 
drawn from a proposal distribution over possible changes, q(8 This sample is either rejected, in 

which case the current sample is repeated, or the proposed sample is accepted as the new sample. The 
Metropolis-Hastings acceptance probability, 



f{6) q(8 -> 6> 

guarantees that the sequence of samples will converge to the posterior distribution, f(0\D) oc f(8)f(D\8), 
the prior times the likelihood. 

For the group model, the samples 8 consist of hyperparameters for the model variant — the Dirichlet 
process prior parameter x an d the beta prior parameters a and /3 — as well as the group count K and 
assignment vector G. The link probabilities P governing links between groups are not included, because 
the likelihood function would not be compatible between partitions with different values of K. One 
possible solution to this problem would be to include P in the sampling procedure, restrict K to & 
particular number for a particular run, and then appropriately weight runs with different values of K. 



Another approach is reversible-jump MCMC 58 , which appropriately handles a mapping between two 
different parameter spaces as part of the Metropolis-Hastings proposal ratio. (We tried a reversible-jump 
scheme, but chains tended to get stuck at local maxima.) 

Instead of trying to sample values of P, we use the marginal likelihood of a partition given model 
hyperparameters — that is, the posterior distribution, conditional on values of a, j3, and G, integrated 
over all possible values of P — directly in the Metropolis-Hastings procedure. This is possible because the 
marginal likelihood of a single partition can be calculated analytically. 

For a beta prior over link probabilities, the likelihood of G, a and /3 marginalized over all possible 
values of P is 

/(A|G, a,p)=J f(P\a, /3)/(A|G, P) dP (21) 



K K „i 



1=1 3 

^rfr B(Y l] +a,Z lJ +f3) 
HH B(a,B) 

i=lj = l V ' ' ' 



(22) 
(23) 
(24) 



Similarly, for a uniform prior over link probabilities, the marginal likelihood of a particular partition is 
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simply 

K K 

/(AiG)=nn i? (^+ i '^+ 1 )- ( 25 ) 
»=i j=i 

Proposal Distribution 

For the case of uniform priors on partitions and link probabilities, the proposal distribution only allows 
changes to the partition. With the Dirichlct process prior on partitions and the beta prior on link 
probabilities, hyperparameters x, a , and (3 can also be changed. 
The proposal distribution is described as follows: 

1. Each hyperparameter h (a, (3, and x) is chosen for update with probability ph, where ph are tuned 
to improve convergence. A proposed new value b! is drawn from a uniform distribution between 
max(0, h — Th) and h + rh, where Th is a proposal radius manually tuned to improve convergence. 
(A scale-free proposal could easily be used instead, and may require less tuning.) 

2. With probability (1 — J2 h Ph), a group-change move is proposed: 

(a) A node i is chosen uniformly at random as the species to be moved. 

(b) Another node j =/= i is chosen uniformly at random. 

(c) If i and j are in different groups, node i is moved into the group of node j. If i and j are in 
the same group, node i is moved into a new group. 



Metropolis-coupled MCMC 

Although the Metropolis-Hastings algorithm is guaranteed to converge to the target distribution at some 
point, local maxima in the likelihood surface can cause a chain to become stuck for long periods of 
time. One approach to avoiding this problem, known as "Metropolis coupling," involves running multiple 
chains in parallel. One chain, the "cold chain," explores the target distribution, while the other chains, 
"hot chains," explore low-likelihood configurations more freely. Periodically, swaps are proposed between 
chains, allowing good configurations discovered on hot chains to propagate toward the cold chain. 
Rather than exploring the target distribution f(6\D) cx f(9)f(D\0), heated chains explore 

f T (e\D)<xf(0)\f(D\e)] T re [0,1], (26) 

where r is a heating parameter. We use linearly spaced values of r, with the hottest chain exploring the 
prior (r = 0) and the coldest chain exploring the posterior (r = 1). 

Swap moves are standard Metropolis-Hastings proposals, but rather than considering a change to a 
single chain, they consider a change to the joint distribution of two chains. The acceptance probability 
is thus the ratio of the joint distribution after and before the move: 

r((o o)^(o B ^- fJMMMiKilMMiPMi. ( o 7) 



f{D\6i) 



Km) 



(28) 



where 6i,9j are the configurations that begin in chains i and j, and Ti,Tj are the heat parameters of the 
two chains. 

The use of multiple heated chains has the side effect of drastically improving estimates of marginal 
likelihoods for model selection, as described in the next section. 
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Model Selection via Marginal Likelihood 

The marginal likelihood of a model is the likelihood averaged over the prior distribution. That is, it is 
the likelihood one would expect by randomly sampling parameters from the prior distribution: 

f(D\M)= [ f(8)f(D\9)d8. (29) 



This value serves as a useful measure of model fit because it directly incorporates the dependence of the 
likelihood on uncertainty in parameter values, implicitly penalizing extra degrees of freedom [49] - If an 
additional parameter improves the maximum likelihood but decreases the average likelihood, the model 
suffers from overfitting relative to the simpler model. 

Marginal Likelihood for the Group Model 

The marginal likelihood for the group model involves integrating over all hyperparameters, partitions, 
and link probabilities. For the model with uniform distributions over partitions and link probabilities, 
the marginal likelihood is 



/(A|M„, M ) = ^/(G)/(A|G) (30) 

(31) 



G 

K K 



y— 



.=1.7=1 



With a Dirichlet process prior over partitions and a uniform distribution over link probabilities, the 
marginal likelihood is similarly 

poo 

f(A\M d}U )= /( X )V/(G|x)/(A|G)d X . (32) 
Jo g 

Using a uniform prior over partitions and a beta prior over link probabilities yields 

POO POO 

f(A\M Utb ) = V/(G) / f(a) f(fi)f{A\G,a,p)dpda. (33) 
q Jo Jo 

Combining both gives 

POO POO POO 

f(A\M d , b ) = /(X)V/(G| X ) / f(a) f(P)f(A\G,a,(3)df3dad X . (34) 
Jo G Jo Jo 

Thermodynamic Integration for Marginal Likelihood Estimation 

As enumeration across all possible partitions is impossible for networks of any significant size, we would 
like to use MCMC to estimate the marginal likelihood for the sake of comparison among different models. 
Marginal likelihood estimates derived from a single chain, such as the harmonic mean estimator of Raftery 



31 , converge very slowly, because MCMC fails to sample sufficiently from low-likelihood areas. However, 
it is possible to use the information gathered about low-likelihood areas in heated chains using a technique 
called thermodynamic integration [59j[60], or path sampling [6l] . 

Assuming a continuum of heated chains, the thermodynamic estimator for the log-marginal likelihood 
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is 



log£(M) = 



„1 m 



)log£(^, T )dr 



(35) 



where m is the number of samples in the MCMC output, and 8 i T is a single sample from the output in 
a chain with heat parameter r 60 . With a finite number of chains, we estimate this integral using cubic 



spline interpolation as implemented in the splinefun function in the R software package 62 



Consensus Partitions 

The full output of an MCMC chain from the group model includes an extremely large number of different 
partitions, and, for the sake of interpretation, it is desirable to seek a consensus partition that does a 
reasonable job of summarizing the distribution. We use a simple, computationally inexpensive method to 
accomplish this task: in short, clustering the nodes in the network based on a pairwise group-membership 
matrix. 

The group-membership matrix M is the posterior probability that two nodes are in the same group 
and otherwise, that is, 

M = ^P(G|A)M G , (36) 

G 

where an entry Mq is 1 if nodes i and j are in the same group, that is, 

Mc, = 'k: xi , (37) 

where 5 is the Kronecker delta and G is the assignment vector for the partition. This matrix is estimated 
from MCMC output as the fraction of MCMC samples in which the corresponding species are in the 
same group: 

1 N 

M=-]TM Gs . (38) 

i=l 

A consensus partition is formed by applying a hierarchical clustering algorithm to the group- 
membership matrix estimate M, and then cutting the dendrogram at some number of groups K, forming 
a consensus partition with assignment vector Gk and group- membership matrix M^-. The goodness 
of fit of a consensus partition is simply measured as the correlation between M and M^-). The best 
consensus partition is thus identified using the value of K that gives the highest correlation. 

We use the aver age- linkage clustering algorithm |63| as implemented by the hclust function in the 
R software package 62 , treating 1 — M as distance matrix. We find that the average- linkage algorithm 
produces higher correlations than the other algorithms implemented as well as ideal K close to the mean 
K in the MCMC output. Furthermore, we find that consensus partitions produce higher correlations 
with the M than any individual partition in the MCMC output. 
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Figure 1. The Serengeti food web. The network is shown using a spring-layout algorithm without 
clustering. Plant, herbivore, and carnivore nodes are green, orange, and red, respectively. 



Supporting Information 

(See supporting file tab_Sl_species_list . csv for data.) 



Table SI. Species in the Serengeti food web. 
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Figure 2. Posterior distributions and prior expectations of aggregation parameter \ an d 
group count K. 

(See supporting file tab_S2_edge_list . csv for data.) 

Table S2. Feeding links in the Serengeti food web. 

(See supporting file tab_S3_consensus_partition. csv for data.) 

Table S3. 16-group consensus partition. 

(See supporting file tab_S4_link_density . csv for data.) 

Table S4. Link densities between groups in the 16-group consensus partition. 

Tables 



Table 1. Marginal likelihood estimates for model variants calculated via thermodynamic 
integration. 



Partition prior 


Link prior 


Log marginal likelihood estimate 


95% bootstrap confidence int. 


Uniform 


Uniform 


-1826.87 


(-1826.94,-1826.82) 


Uniform 


Beta 


-1463.22 


(-1463.39,-1463.03) 


Dirichlet process 


Uniform 


-1547.41 


(-1547.45,-1547.37) 


Dirichlct process 


Beta 


-1356.96 


(-1357.02,-1356.90) 


One group 


Uniform 


-2870.58 


(exact) 


170 groups 


Uniform 


-20031.95 


(exact) 


170 groups 


Beta 


-2870.58 


(exact) 
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Figure 3. Pairwise group membership matrix. Species are identically ordered top to bottom and 
left to right according to the 16-group consensus partition as listed in Table [2j Hue indicates group 
identity; color saturation indicates the fraction of partitions in which species occupy the same group. 
Note that this image conveys information about group membership, not network connectivity. 
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Figure 4. Adjacency matrix ordered by groups. Species are identically ordered top to bottom 
and left to right according to the 16-group consensus partition as listed in Table [2] Black matrix entries 
indicate that the species in the column feeds on the species in the row. Columns that would indicate 
prey of plant groups are omitted. Note that in a modular network according to the standard definition, 
links would be concentrated on the diagonal of the adjacency matrix, since they occur within groups. 
By contrast, here links are concentrated in off-diagonal blocks. 
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Figure 5. Network layout of groups. The network is shown organized and colored by group 
according to the 16-group consensus partition listed in Table [2| 



Figure 6. Network layout of aggregated groups. Nodes in the network are aggregated and 
colored by group according to the 16-group consensus partition listed in Table [2j and arranged 
vertically by trophic level. Line thickness indicates the link density between groups. Node area is 
proportional to the number of species in a group. 
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Figure SI. Implicit prior on number of groups with uniform partition prior. The prior 
distribution on number of groups K is shown for a uniform partition prior for a network with 100 
nodes. The mode of the distribution is at K — 28. 
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Figure S2. Posterior distributions of link density parameters a and [3. Color brightness 
indicates posterior density, estimated using the ks multivariate kernel density estimation package for 



R 64 . Contours indicate cumulative density. The a parameter is significantly lower than 1, indicating 



departure from a uniform distribution. 
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Figure S3. Distribution of link probability parameters. The prior distribution for link 
probability parameters, integrated over the priors for beta distribution parameters a and (3, is indicated 
with a dotted line. The heat map shows beta distributions corresponding to the posterior distribution 
for a and j3, with lightness indicating the posterior density of the parameter values. 
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Table 2. Groups identified in the Serengeti food web using a 16-group consensus partition. 



Group 1 
Group 2 
Group 3 
Group 4 

Group 5 



Group 6 
Group 7 
Group 8 
Group 9 
Group 10 
Group 11 



Group 12 

Group 13 
Group 14 



Group 15 
Group 16 



Crocuta crocuta, Lycaon pictus, Panthera leo, Panthera pardus, Acinonyx jubatus 
Canis aureus 

Canis mesomelas, Leptailurus serval, Caracal caracal 

Connochaetus taurinus, Gazella granti, Gazella thomsoni, Equus burchclli, Alcclaphus 
buselaphus, Aepyceros melampus, Damaliscus korrigum 

Kobus ellipsiprymnus, Phacochaerus aethiopicus, Tragelaphus scriptus, Ourebia ourebi, 
Redunca redunca, Pedetes capensis, Taurotragus oryx, Rhabdomys pumilio, Hippopota- 
mus amphibus, Cercopithccus acthiops 
Syncerus caffer 

Hcterohyrax brucei, Procavia capensis 
Agama planiceps 

Papio anubis, Giraffa camelopardalis, Madoqua kirkii 
Loxodenta africana 

Panicum coloratum, Sporobolus pyramidalis, Hyparrhcnia filipcndula, Harpachne schim- 
peri, Digitaria macroblcphara, Eragrostis tenuifolia, Grewia bicolor, Aristida adoensis, 
Brachiaria semiundulata, Pennisetum mezianum, Bothriochloa insculpta, Panicum max- 
imum, Sida spp., Eustachys paspaloides, Croton macrostachyus, Solanum incanum, In- 
digofcra hochstetteri, Hibiscus spp., Hctcropogon contortus, Cynodon dactylon, Themeda 
triandra, Balanites aegytiaca 

Digitaria scalarum/abysinnica, Dinebra retroflexa, Ischacmum afrum, Eragostris cilianen- 
sis, Hyparrhenia rufa, Sporobolus fimbriatus, Sporobolus spicatus 
Microchloa kunthii 

Echinochloa haploclada, Digitaria milanjiana, Panicum deustum, Digitaria ternata, An- 
dropogon schircnsis, Cymbopogon excavatus, Setaria sphacelata, Typha capensis, Se- 
taria pallidifusca, Phragmites mauritianus, Eragrostis exaspcrata, Andropogon green- 
wayi, Lonchocarpus eriocalyx, Sporobolus centrifugus, Hyparrhenia dissoluta, Chloris 
roxburghiana, Aristidia hordacea, Chloris pycnothrix, Panicum repens, Aristidia kenyen- 
sis, Combretum molle, Acacia xanthophloea, Disperma kilimandscharica, Vossia cusp- 
ida, Odyssea jaegeri, Sporobolus ioclados, Euphorbia candelabrum, Sorghum versicolor, 
Kigclia africana, Olea spp., Sporobolus festivus, Acacia pallens, Crotalaria spinosa, Digi- 
taria diagonalis, Boscia augustifolia, Acacia robusta, Acacia seyal/hockii, Chloris gayana, 
Pennisetum stramineum, Commiphora africana/ trothae 
Acacia Senegal, Acacia tortilis 

Grewia fallax, Cissus quadrangularis, Cissus rotundifolia, Commelina africana, Allophy- 
lus rubifolus, Sensevieria ehrenbergiana, Pavetta assimilis, Phyllanthus sepialis, Acalypha 
fructicosa, Maerua triphllya, Ficus glumosa, Croton dichogamus, Sclerocarya birrea, Cap- 
paris tomentosa, Ximenia caffra, Cordia ovalis, Grewia trichocarpa, Abutilon angulatum, 
Pappaea capensis, Commiphora schimperi, Albuca spp., Ficus ingens, Hoslundia opposita, 
Ocinum suave, Cenchrus ciliaris, Solanum dennekense, Aloe macrosiphon, Indigofera basi- 
fiora, Ipomoea obscura, Albizia harveyi, Ficus thinningii, Emilia coccinea, Cyphostemma 
nierensis, Spirocarpa spp., Sensevieria suffruticosa, Pupalia lappacea, Aloe secundiflora, 
Turreae fischeri, Pavonia patens, Jasminum fluminense, Acacia clavigera, Cassia didy- 
mobotrya, Kedrotis foetidissima, Hypoestes forskalii, Zisiphus mucronata, Commiphora 
mcrkcri, Blcpharis acanthoides, Iboza sp., Rhoicissus revoilii, Kalanchoe sp., Solanum 
nigrum, Achyranthes aspera, Digitaria velutina, Tricholaena eichingeri, Lippia ukamben- 
sis, Hcliotropium stcudncri, Kyllinga nervosa, Sporobolus stapfianus, Cyperus kilimand- 
scharica, Pellaea calomelanos, Sporobolus pellucidus, Eragrostis aspera, Eriochloa nubica, 
Dihetcropogon amplectus 



